12 research outputs found

    GO Hessian for Expectation-Based Objectives

    Full text link
    An unbiased low-variance gradient estimator, termed GO gradient, was proposed recently for expectation-based objectives EqÎł(y)[f(y)]\mathbb{E}_{q_{\boldsymbol{\gamma}}(\boldsymbol{y})} [f(\boldsymbol{y})], where the random variable (RV) y\boldsymbol{y} may be drawn from a stochastic computation graph with continuous (non-reparameterizable) internal nodes and continuous/discrete leaves. Upgrading the GO gradient, we present for EqÎł(y)[f(y)]\mathbb{E}_{q_{\boldsymbol{\boldsymbol{\gamma}}}(\boldsymbol{y})} [f(\boldsymbol{y})] an unbiased low-variance Hessian estimator, named GO Hessian. Considering practical implementation, we reveal that GO Hessian is easy-to-use with auto-differentiation and Hessian-vector products, enabling efficient cheap exploitation of curvature information over stochastic computation graphs. As representative examples, we present the GO Hessian for non-reparameterizable gamma and negative binomial RVs/nodes. Based on the GO Hessian, we design a new second-order method for EqÎł(y)[f(y)]\mathbb{E}_{q_{\boldsymbol{\boldsymbol{\gamma}}}(\boldsymbol{y})} [f(\boldsymbol{y})], with rigorous experiments conducted to verify its effectiveness and efficiency

    The Poisson Gamma Belief Network

    No full text
    Abstract To infer a multilayer representation of high-dimensional count vectors, we propose the Poisson gamma belief network (PGBN) that factorizes each of its layers into the product of a connection weight matrix and the nonnegative real hidden units of the next layer. The PGBN's hidden layers are jointly trained with an upward-downward Gibbs sampler, each iteration of which upward samples Dirichlet distributed connection weight vectors starting from the first layer (bottom data layer), and then downward samples gamma distributed hidden units starting from the top hidden layer. The gamma-negative binomial process combined with a layer-wise training strategy allows the PGBN to infer the width of each layer given a fixed budget on the width of the first layer. The PGBN with a single hidden layer reduces to Poisson factor analysis. Example results on text analysis illustrate interesting relationships between the width of the first layer and the inferred network structure, and demonstrate that the PGBN, whose hidden units are imposed with correlated gamma priors, can add more layers to increase its performance gains over Poisson factor analysis, given the same limit on the width of the first layer

    Bridging Maximum Likelihood and Adversarial Learning via α-Divergence

    No full text
    Maximum likelihood (ML) and adversarial learning are two popular approaches for training generative models, and from many perspectives these techniques are complementary. ML learning encourages the capture of all data modes, and it is typically characterized by stable training. However, ML learning tends to distribute probability mass diffusely over the data space, e.g., yielding blurry synthetic images. Adversarial learning is well known to synthesize highly realistic natural images, despite practical challenges like mode dropping and delicate training. We propose an α-Bridge to unify the advantages of ML and adversarial learning, enabling the smooth transfer from one to the other via the α-divergence. We reveal that generalizations of the α-Bridge are closely related to approaches developed recently to regularize adversarial learning, providing insights into that prior work, and further understanding of why the α-Bridge performs well in practice

    Compressive Sensing Based Soft Video Broadcast Using Spatial and Temporal Sparsity

    No full text
    Video broadcasting over wireless network has become a very popular application. However, the conventional digital video broadcasting framework can hardly accommodate heterogeneous users with diverse channel conditions, which is called the cliff effects. To overcome this cliff effects and provide a graceful degradation to multi-receivers, in this paper, we use the nonlocal sparsity and hierarchical GOP structure to propose a novel CS based soft video broadcast scheme. CS has properties of minimizing bandwidth consumption and generating measurements with equal importance which are exactly needed by video soft broadcast. In the proposed scheme, the measurement data are generated by block-wise compressive sensing (BCS), and then the measurement data packets are sent over a highly dense constellation though OFDM channel to achieve a simple encoder. Ideally, with the GOP structure, inter frame has lower sampling rate than intra frame to achieve better compression efficiency. At the decoder side, due to equally-important packets and property of soft broadcast, each user can receive the noise-corrupted measurements matching its channel condition and reconstruct video. The hierarchical GOP structure is presented to explode the correlation and non-local sparsity among video frames during the recover process. Additionally, using non-local sparsity, group based CS reconstruction with adaptive dictionaries is proposed to improve decoding quality. The experimental results show that the proposed scheme provides better performance compared with the traditional SoftCast with up to 8 dB coding gain for some channel conditions.SCI(E)[email protected]

    Magnetic Hydroxyapatite-Coated Iron–Chromium Microspheres for Dental Surface Polishing and Plaque Removal

    No full text
    This research aimed to engineer magnetic hydroxyapatite-coated iron–chromium (HAp–FeCr) microspheres to enhance dental surface polishing and plaque elimination. Utilizing a tailored sol–gel approach, the HAp–FeCr microspheres were synthesized and exhaustively characterized via scanning electron microscopy, energy-dispersive X-ray spectroscopy, ζ-potential, X-ray diffractometry, and X-ray photoelectron spectroscopy methodologies. Key findings showcased that these microspheres retained their magnetic properties post-HAp coating, as evidenced by the magnetization curves. An innovative magnetic polishing system was developed, incorporating these microspheres and a 2000 rpm magnet. Comparative evaluations between traditional air-powder polishing and the proposed magnetic technique demonstrated the latter’s superiority. Notably, the magnetic polishing led to a substantial reduction in dental plaque on the tooth surface, decreasing bacterial adhesion and early biofilm formation by Streptococcus gordonii and Lactobacillus acidophilus, where the most pronounced effects were observed in samples with elevated HAp content. A significant 60% reduction in dental plaque was achieved with the magnetic method relative to air-powder polishing. Furthermore, the HAp–FeCr microspheres’ biocompatibility was verified through cytotoxicity tests and animal studies. In essence, the magnetic HAp–FeCr microspheres present a novel and efficient strategy for dental treatments, holding immense potential for improving oral health
    corecore